Language Model Adaptation with the Use of Presentation Slide Information for Automatic Lecture Transcription
نویسندگان
چکیده
We propose a language model adaptation method with the use of presentation slide information for automatic lecture transcription. N-gram probabilities are rescaled with lecture-dependent unigram probabilities estimated by PLSA using all slides of the lecture. In addition, the N-gram language model is interpolated with a model trained with the Web texts collected via the Web search, using keywords extracted from the slides. Moreover, N-best hypotheses of ASR are rescored using word probabilities enhanced with a cache model using the slide corresponding to each utterance. Experimental evaluations on real lectures show that the proposed method with the combination of the global and local slide information achieves a significant improvement of ASR accuracy.
منابع مشابه
Dynamic language model adaptation using presentation slides for lecture speech recognition
We propose a dynamic language model adaptation method that uses the temporal information from lecture slides for lecture speech recognition. The proposed method consists of two steps. First, the language model is adapted with the text information extracted from all the slides of a given lecture. Next, the text information of a given slide is extracted based on temporal information and used for ...
متن کاملUnsupervised topic adaptation for lecture speech retrieval
We are developing a cross-media information retrieval system, in which users can view specific segments of lecture videos by submitting text queries. To produce a text index, the audio track is extracted from a lecture video and a transcription is generated by automatic speech recognition. In this paper, to improve the quality of our retrieval system, we extensively investigate the effects of a...
متن کاملLanguage Model Adaptation for Lecture Transcription by Document Retrieval
With the spread of MOOCs and video lecture repositories it is more important than ever to have accurate methods for automatically transcribing video lectures. In this work, we propose a simple yet effective language model adaptation technique based on document retrieval from the web. This technique is combined with slide adaptation, and compared against a strong baseline language model and a st...
متن کاملAutomatic slide assignation for language model adaptation
Online multimedia repositories are rapidly growing and imposing themselves as fundamental knowledge assets. This is particularly true in the area of education, where large repositories of video lectures are being built, making education accessible to a wide community of potential students. As with many other repositories, most lectures are not transcribed because of the lack of efficient soluti...
متن کاملAutomatic transcription of lecture speech using topic-independent language modeling
We approach lecture speech recognition with a topicindependent language model and its adaptation. As lecture speech has its characteristic style that is different from newspapers and conversations, dedicated language modeling is needed. The problem is that, although lectures have many keywords specific to the topic and fields, available corpus of each domain is limited in size. Thus, we introdu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007